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ABSTRACT 

This paper reviews the previous work on the adaptive control of robotic systesis. 
Although the field is relatively new and does not yet represent a mature discipline, 
considerable attention for the design of sophisticated robot controllers has occurred. In 
this presentation, adaptive control methods are divided into model reference adaptive 
systems and self-tuning regulators with further definition of various approaches given in 
each class. The similarity and distinct features of the designed controllers are delineated 
and tabulated to enhance comparative review. 


1 . INTRODUCTION 

Control of robotic manipulators is a challenging problem mainly due to tne nonlinear 
and coupled nature of the system dynamics. A considerable amount of valuable work has been 
produced in the dynamic formulation and the control of these systems within the last two 
decades. Since the pioneering works of Uicker { 1 ) , Hooker and Margulies {21, and Kahn and 
Roth [31 on the formulation of dynamics, researchers have concentrated on the efficient 
computer implementation and numerical construction of the dynamic equations, while the work 
on the efficient dynamic equation algorithms is still going on, control of manipulators has 
also received significant attention. Over the years, literature on the manipulator control 
methods using optimization, linearization, nonlinearity compensation, and recently, adaptive 
technique* has become quite rich. 

This paper reviews the accumulated work in the area of adaptive control as applied to 
robotics. The reader should note that adaptive control in itself is not yet a mature 
discipline in systems .heory. Also, since some of the existing tools in adaptive control 
are strictly for linear and/or time- invar lant systems, their application to robotics 
deserves special attention. The immaturity of adaptive control is best demonstrated by the 
lack of a definition of adaptive control agreed to by the leading researchers [4]. 

According t. Webster's dictionary, to adapt means "to adjust (oneself) to new 
circumstances". Adaptive control, then, in essence, is used to mean a sophisticated, 
flexible control system relative to the conventional fixed feedback system. An adaptive 
system will assure quality system performance when large and unpredictable variations m the 
plant dynamics or loading occur. Although our aim is by no meant to establish the missing 
definition, since the robotics community seems to have reached a consensus on what is meant 
by adaptive control, we will give our definition to illustrate our interpretation of 
adaptive control. 

Definition : A feedback control system is adaptive if the gains are 

selected with the on-line information of plant outputs 
and/or plant state variables. 

This definition is depicted in block diagram format in Figure 1. The above definition 
encompasses all the previous work on the adaptive control of manipulators currently 
available to us. 
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Although the urly work on adaptive control dates back to tho 1990a, its first 
extensive application to robotics was given by Du bows ky and DesForges In 1979 (9). tines 
than a variety of dlffarant as t hods has basn developed. So far, tha existing adapt iva 
control methods applied to robotics say ba categorised under tha design of 

1. Nodal Reference Adaptive Systeas (MAAS) 

ii. Self-tuning Regulators (STR) 

The following sethods are used in the design of MAASt 

i. Local paraaetric optimisation 

ii. Lyapunov’s second sethod 

ill. Hyperstability theory 

iv. Sliding control theory 

The STR design procedure say be divided into three steps s 

i. Selection of a parametric structure to represent the robotic system via 
discste-time modeling 

ii. On-line estimation of system parameters using the least squares, extended least 
squares or maximum likelihood methods 

iii. On-line controller design based on the estimated system parameters via extended 
minimum variance or pole-sera placement techniques 

Block diagrams of MRAS and STR are illustrated in Figures 2 and 3. Note that the 
dotted boxes in these figures may be reduced to the regulator block in Figure 1. After s 
brief review of system dynamics, the related background work is presented below. 



Fiqvz a 2. Block Oiagraa of *odal ftafaranca Adaptive Syatea ftqur* 3. Block Olagr— of Self-tunine Regulator 
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SYSTEM DYNAMICS 


Dynamic equations of an n-link, n -d eg ree-of- freedom, spatial, sarial robot an with 
rigid links ars given by 


At#) I - r ( I# i) • - G ( I) I - u 


(1) 


whara #cR n is tha relative joint displacement vactor {* • (•* i*J*cR 2n )» A(t)ck n,tn - ia tha 

generalised inartia Matrix, -f - -P(ft,i)icR n raprasants tha inartia torquas dua to 

can trifocal and Coriollis accelerations, -g - -C(4)4cR n is tha gravity loads as saan at tha 

joints whara G(t)«I ,Mtn is non-uniqve, and u f R n is tha control. In state-space 
representation, Cq. (1) can ba given by 


A~ l G A -1 F 


i ♦ 


(21 


Mot a that functional dapandancias ara droppad for clarity. If aach actuator (O.C. Motor) is 
aodalad as a sacond-ordar , linear, tiMa- invariant subsyatam (neglecting tha anatura 
inductanca) , and is couplad with tha Manipulator dynasties, tha praviously dafinad stata 
vactor, x, will ba preserved and tha control will ba tha actuator input voltage. In this 
casa, Eq. (2) takas tha following fon 
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whara a • A ♦ J is tha cowbinad inartia matrix with J • diagfJ ], J ic tha rotor inartia of 

k k 

tha k th actuator rafarrad to output shaft, E ^ , Ej and Ej ara diagonal, positiva dafinita 
constant matrices and functions of various actuator/gaar train paraaatars. 

Although most of tha works do not includa tha actuator dynamics, the abova, 
simplified form may ba substitutad, sinca tha form of tha aquations remains tha cam . 
Depending on tha adaptation algorithsi, thasa constant actuator paraaatars may aithar ba 
includad in tha on-line idantif ication schama or assuaad known. In our prasantation, tha 
ganaric u will raprasant tha suitabla control (aithar tha affactiva input torquas or tha 
voltages j . Tha only axcaption is (6] whara third-order actuator dynaaics is studiad in 
addition to tha abova siaplifiad form. Tha dynamic equations, Eq. (2) or (3), may ba givan 
in tarns of tha robot-hand coordinatas axprassad in a fixad rafaranca frama (task-oriented 
coordinates ) and adaptive controllers may be designed for this systesi (6,?, 8]. 


3. KRAS-BASED CONTROLLERS 

In KRAS design, usually a sacond-ordar , linear, time-invariant, continuous- time 
rafaranca modal is selected for aach link of tha sarial robot. Than, a control law is 
derived to force tha robot to behave like tha selected modal. As mentioned earlier, local 
parametric optimization (5,9], Lyapunov's second method [10], hyperstability (11,12,13], or 
tha sliding control theory [14,15,16] is usually employed to achieve tha goal. 

In 1979, Dubowsky and DesForges [S] implemented tha local parametric optimization 
technique on a robot arm. In their formulation, aach servomechanism is modeled as a 
sacond-ordar , single-input, single-output system, neglecting the coupling between system 
degrees of freedom. Than for aach degree-of- freedom, position and velocity feedback gains 
ara calculated by an algorithm which minimizes a positiva sami-daf inita error function 
utilizing tha steepest descant method. Stability is investigated for tha uncoupled, 
linearized system model. This work represents the first implementation of adaptive control 
to robotics. 
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The recent works hsvs concentrated on the designs based on ths Lyapunov's second method 
and the hyperstability theory. In the most general case, thasa control methods yiald tha 
following control structura u * 


u p ‘ Vl'WVV * Vi'WW * 4 3 f 3 J '*p ' x r ' *r* 


where subscripts p and r raprasant tha plant and rafaranca model, respectively, 5 ^ is aithar 
0 or 1 ( K^cP nxn nonlinaar or constant gain matrix, 1*1,2, or 3, a^cR* raprasants an 

unknown systaa paraaatar Ilka tha payload, link mass contant, cantar of gravity location, 
ate., vhara a coablnation of thasa constant paraaatars or a nonlinaar tara is to ba 
astiaatad in tha adaptation proeass, and j»l,2,...,k, vhara k dapands on tha spaclfic 
controller design. Although some controllers call for plant joint accelerations, they are 
not shown in Cq. 14) . 

Tha first tans f, in Cq. (4) describes tha nonlinearity coapensation. It may 

represent tha complete manipulator dynamics as in (17,16), or only tha gravity terms and tha 
Jacobian as in (7). a controller with « 1 and 5^ • * 0 indicates only a nonlinearity 

compensation. Tha second term in u p represents the feedback portion of tha controller. Tha 
gain matrices may either be nonlinaar or constant. No v • 0 and 5^ • 1 represent 

tha control structures of (16,19,20,21) among others. Tha third term in u p includes tha 
portion of tha control where systaa parameters are estimated (17,18,22). Slotina (18), for 
example, includes all tha components Into his controller, therefore, 6^ * Jj • Jj - 1. 
Takegaki and Arimoto's control strategy (7) may ba susttarized by ^ • 1 and - 0, 

Horowitz and Toaizuka’s (22) * 0, * 1, etc. 

Various MRA5 control structures are summarized in Table 1. This table differentiates 
tha methods which require explicit calculation of dynamic equations (Nonlinearity 
Compensation) from tha methods which adaptively estimate tha plant parameters on-line 
(Incorporation of Plant Parameter estimation). However, further distinction is needed in 
tha latter group, since while one approach explicitly identifies the nonlinear terms (as ia 
A, G, and r with reference to Cq. (2)), and estimates them on-line, tha recant methods treat 
the constant robot parameters as unavailable, estimate and compensate them in their 
algorithms. Some methods choose nonlinear feedback matrices in their controllers (H, L, S 
in Table l) without incorporating the explicit system parameter* estimations . 


The early works presented in Table 1 have generally avoided the nonlinearity 
compensation and opted for the assumption that the nonlinear system parameters vary slowly 
in time. On this basis, a stability analysis is given for the system. This assumption 
almost certainly is too restrictive, since the nonlinear manipulator system parameters are 
functions of the jcint position and velocities. The faster the robot movement is, the more 
rapidly the system parameters will vary. The objective on the other hand, for the more 
sophisticated control methods is to enable fast robot movements with high precision. As a 
result of revolutionary advances in the microprocessor industry with prices steadily coming 
down, the possibility of real-time implementation of computation intensive algorithms is 
steadily improving. Recently, Wander and Tesar (23,241 have implemented the complete 
dynamic equations [251 of a 6-link, general architecture robot arm in 6.5 milli sec. (150 
Hz) in explicit form without using recursion. Their algorithm is able to treat an 
n-degree-of-freedom (DOF) serial system of completely general parameters (43 ailli sec. for 
l 2 -DOF) . They have implemented the algorithm on an Analogic AP-500 array processor. 

Further work on the comparative analysis of various computer architectures is underway at 
the University of Texas at Austin. 


Some of the most recent works include nonlinearity compensation along with a feedback 
portion and parameter identification features (6,17,181. Once the control has the form » 

. ? 

A (9 )u* where subscript p denotes the plant, A_ is the on-line calculated generalized 
P P P P 

inertia matrix and is yet to be selected, generally, global stability of the closed-loop 
system can fce shown provided that where I is the identity matrix of order n, is 

maintained (6). Otherwise, in reference to Table 1, all methods without nonlinearity 
compensation need to assume system parameters stay constant during the adaptation. 
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Table 1 . Summary of MRAS Controllers in the Literature 



1 : Calculates c omplete or partial nonlinaar dynamics on-line. 

; : Robot link lengths, mast corrtanta, actuator parameters, ate., il not otherwise specified. 

3 : If "yes", stability anaiyaia baaad on this aaaumption. 

G : Gravity load companaatad; alao raquiraa on-lina Jacobian calculation. 

A : Raquiraa only tha on-iina calculation of tha inartia matrix. 

0 : Nonlinear-gain taadback ualng local paramatrie optimisation. 

C : Conatant-gain faadback. 

L : Nonlinaar-gain faadback ualng Lyapunov'a second method. 

H : Nonlinaar-gain faadback uaing hyrerstablllty thaory. 

S : Nonlinaar-gain faadback uaing adding thaory. 

N : Structure of nonlinaar syatam paramatara (funetlona of state variablaa) are explicitly assumed 
known and are adaptively estimated; stability analysis based on hyperstability thaory. 
v : Yea. 

- : No. 
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Bales trino, ft al. (19] have developed an adaptive control lar which produces 
discontinuous control signals leading to chattering. Stability analysis is presented using 
hyperstability theory. In (16] , Balestrino, et al. present three methods; the first is 
based on the theory of variable-structure systems, the second on the hyperstability and the 
third is a combination of the first two methods. Designed controllers produce 
high-frequency chatter which is highly undesirable since higher order dynamic modes may be 
excited. Numerical simulations show an extremely high frequency of sign switches in the 
plant input, prohibiting its physical realization. Stoten [21] formulates the MRAS problem 
closely following the procedures in (9) and simulate the algorithm for a 1-link manipulator. 

Horowitz and Tomizuka [22] study the adaptive control of a 3-link arm. Gravity effects 
and the mass and inertia of the first link are neglected. Each nonlinear term in the 
dynamic equations is identified a priori, treated as unknown, and estimated by an adaptation 
algorithm. For the modeled system and the designed controller, stability analysis is given 
by Popov's hyperstability theory. Later, Anex and Hubbard [26] have experimentally 
implemented this algorithm with some modifications. System response to high speed movements 
is not tested, but practical problems encountered during the implementation are addressed in 
detail. 

Takegaki and Arimoto [7] propose an adaptive method to track desired trajectories which 
are described in the task-oriented coordinates. The suggested controller compensates 
gravity terms, calculates the Jacobian and the variable gains, but does not compensate the 
manipulator dynamics completely. System stability is assured if the manipulator hand 
velocity is sufficiently slow, i.e., nonlinear system parameters change slowly. 

Nicosia and Tomei [27] derive control laws using the hyperstability theory to follow a 
linear, time-invariant reference model. The plant (manipulator) parameters and the payload 
are assumed known and are not identified. Their controller does not produce chattering and 
Is relatively easy to implement. Lim and Eslami (20] propose controller designs based on 
Lyapunov's second method. The author's objective is to control the linearized dynamic 
equations with the developed controllers; hence, assuring the stability of the linearized 
system. Later, nonlinearity compensation is suggested to enhance the system response. 

Whyte [281 designs an adaptive controller via Lyapunov's second method. The algorithm does 
not require any knowledge of the manipulator dynamics and selects nonlinear gains in the 
feedback loop to follow the reference model. System stability is shown, provided that the 
parameter changes are slow. Hsia [29] reviews the current methods used in adaptive control 
and gives brief formulations for each method. Vukobratovic , et al. [30], review local 
parametric optimization and hyperstability-based methods and choose not to include the 
approaches based on Lyapunov's second method in their book on the non-adaptive and adaptive 
control of manipulators. 

Tosunoglu and Tesar [6] select a generalized nonlinear reference model which represents 
ideal robot dynamics. The plant, the actual robot whose system parameters may not be 
exactly known, is then forced to behave like the reference model to follow the desired 
tra}ectory. The advantage of the nonlinear reference model is that the adaptation process 
ceases once the nominal trajectory is recovered. (Such is not the case when linear models 
are selected.) Error-driven dynamics is derived and the system is augmented to include the 
integral feedback feature to eliminate the parameter discrepancies between the plant and the 
reference model, and the disturbances acting on the system. It is shown that the 
controllers designed in this work via Lyapunov's second method also produce hyperstable 
systems. Simulations demonstrate successful trajectory tracking on 3- and 6-link, spatial 
manipulators under unknown payloads and estimated system parameters (link lengths, masses, 
inertia components, payload, etc.). The authors also provide comparative analyses of the 
effect of integral feedback and various controller update rates, 60 to 200 Hz. 

Craig, Hsu and Sastry [17] take an interesting approach in designing an adaptive 
controller using the Lyapunov's second method. In this work, the structure of the terms in 
the dynamic equations is assumed known, but their numerical values remain unknown. They 
partition the dynamics into known and unknown portions and estimate the unknown parameters 
along with compensation for the nonlinearities. Global stability is proved by assuming that 
a matrix function of the plant joint position, velocity and accelerations is bounded. 
Although all the terms which are functions of positions are bounded, velocity and 
accelerations may increase without bounds; thus, making the matrix unbounded. However, 
modif ications in the controller structure may alleviate this problem. Their numerical 
simulations identify link masses and Coulomb friction coefficients for a two-link 
manipulator with encouraging results. 


210 


8 lot ins and Li [18] derive a control law with full faadforward dynamics compensation, 

PD feedback and on-line payload and manipulator parameter estimation using Lyapunov's second 
method. Since this control scheme does not eliminate the steady-state errors, the authors 
restrict the steady-state position errors to lie on a sliding surface. This modification, 
in turn, causes the loss of numerical efficiency where. Interestingly, the authors make use 
of the recursive computation feature of the manipulator dynamics. Later, an approximate 
implementation is suggested to improve the numerical efficiency. Payload parameter 
Identification is simulated on a two-link manipulator. 

Once these current methods are refined, application to manipulators with higher degrees 
of freedom will naturally follow. Determination of the structure of the constant terms (for 
identification) for manipulators with higher number links may be achieved with symbolic 
generation of dynamic equations, but the effect of increased number of terms will require 
further investigation. 


4. SELF -TUNING REGULATOR (STR) BASED CONTROLLERS 

In this method, typically, nonlinear manipulator dynamics is linearized about a nominal 
trajectory and then discretized. The discretized model gives the structure of the 
parametric model whose parameters need to be estimated on-line. The parametric model is 
given by the following multivariable difference equation 

y (k) - 0 T 4 (k-1) + e(k> (5) 


where y(k)cR n is the system output, k is the sampling time counter, QcR nx * 2nm+1 * contains 
the parameters to be identified, ♦cR (2n ®* 1 * represents the combined system input and output 

vector, ecR n is a random, zero-mean Gaussian white noise, and m is the order of the 
estimation model. 

Parameter estimation is based on the system identification techniques using the sampled 
input-output data. Although such techniques include the least squares, extended least 
squares and maximum likelihood methods; the recursive least squares method is extensively 
used because of its lower computational requirements [8,25,29-36]. The recursive least 
squares estimation is given by 


e.(k) « 8 i (k-1) + P(k)4(k-1) (y i (k) - 0*(k-l) ♦ (k-1) ] (6) 

where 

poo - 4-t poc-h - , 

A x ♦ ♦ik-l) P (k-1) * (k-1) 

e i (k) represents the estimate of the i^ row of 9 defined in Eq. (5), P(k) is a square 
symmetric matrix of order (2nm+l), and 0 < A < 1 is an exponential forgetting factor. 

Once the parameters are identified at each sampling time, regulator parameters are 
estimated using the extended minimum variance [8,30,32], or pole-zero placement techniques 
[29,33,35]. The method described above is known as explicit or indirect STR. If the 
regulator parameters are estimated directly by a reparameterization of the process model, 
the model is called implicit or direct STR. Usually implicit STR algorithms cancel all 
process zeros making them suitable only for minimum phase systems. 

Koivo and Guo [32] assume an autoregressive model and identify system parameters using 
the recursive least squares technique. They design an extended minimum variance controller 
for the estimated model. The method chooses a quadratic performance index in terms of the 
state error vector and the system control vector and minimizes it relative to admissible 
controls while satisfying Eq. (5) . Their simulations Include decoupled and partially 
coupled parametric model structures. They report that the parameter convergence is faster 
in the decoupled case, and that no significant Improvement in the system response is 
observed for the coupled model. This is rather interesting, because the amount of on-line 
calculations is considerably reduced for the decoupled case. Also reported is the fact that 
the model and the controller parameter identifications may not converge fast enough while 
the robot motion takes place. 
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Lee [8] derives the perturbation equations, discretizes then and estinates the systen 
parameters using the recursive least squares method. Then an adaptive controller is 
designed using the extended minimum variance technique. The parameter identification 
requires the estimation of 6n J parameters on-line (216 for a 6-link manipulator). Lee 
provides a detailed breakdown of the computational requirements and concludes that for a 
6-link manipulator the control scheme can be updated at about 56 Hz on a PDP-11/45. 

Hsia [29] reviews the STR formulation on a decoupled model. Kamik and Sinha (35 J 
develop an STR based on a non-minimum phase model which assigns the system poles while 
retaining all the zeros. This algorithm is developed for a UNIMATE-2000 robot. Landau 
[9,36] and Vukobratovic, et al. (30] review various STR designs in detail. 

In general, discrete-time formulation is especially suitable for computer control. 
However, on-line estimation of all system parameters and the control design make STR 
computationally intensive. Astrom [4] reports that numerical estimation techniques tend to 
be numerically unstable as the number of parameters increases in the system model. In this 
case, the complete system is parameterized. However, the papers reviewed in this work do 
not raise the question with regard to numerical instability although they do indicate the 
importance of initial parameter selections. In STR methods, convergence of estimated 
parameters in the adaptation process is guaranteed if the system parameters are constant. 
Since the actual robot model parameters are nonlinear functions of the state vector, the 
question of system parameters varying slowly in time again arises in the STR methods. 


5. CONCLUSIONS 

Adaptive control of robotic systems has received significant attention within the past 
few years. A class of control laws based on the MRAS design realize the adaptation through 
signal synthesis with a completely known parameter structure, while some methods select a 
subclass of the parameters for identification for reduced computational burden. All 
adaptive controllers via STR design and some MRAS-based methods estimate the complete 
(nonlinear) system parameters on-line. 

Stability analysis usually relies on the condition that the nonlinear system parameters 
vary slowly. This condition is removed if a nonlinearity compensation component is also 
incorporated i the controller. The most recent works, which exploit the special structure 
of manipulator dynamics, seem to favor this feature. The use of state-of-the-art 
microprocessor technology along with the sophisticated dynamic formulation algorithms 
strongly indicate that real-time implementation of dynamic compensation is rapidly becoming 
feasible. 

Further research to perfect the existing algorithms and to provide rigorous stability 
proofs, which will improve the maturity of the adaptive control, is still needed. Although 
today's industrial robots employ linear controllers to accomplish a number of useful tasks, 
fast and precise robot movements remain to be implemented. Development of laboratory test 
beds and implementation of the developed adaptive controllers on robotic manipulators are 
also crucial, since only after successful demonstrations will technology transfer be 
possible. 
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